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Abstract 

We give an approximation algorithm for non-uniform sparsest cut with the following guar- 
antee: For any s, 5 £ (0, 1), given cost and demand graphs with edge weights C, D : (Z) — > K+ 
respectively, we can find a set T C V with at most times the optimal non-uniform 

sparsest cut value, in time 2 r K s ^ poly(n) provided A r > <I>*/(1 — S). Here A r is the r'th smallest 
generalized eigenvalue of the Laplacian matrices of cost and demand graphs; C(T, V\T) (resp. 
D(T, V \ T)) is the weight of edges crossing the (T, V \T) cut in cost (resp. demand) graph 
and $* is the sparsity of the optimal cut. In words, we show that the non-uniform sparsest 
cut problem is easy when the generalized spectrum grows moderately fast. To the best of our 
knowledge, there were no results based on higher order spectra for non-uniform sparsest cut 
prior to this work. 

Even for uniform sparsest cut, the quantitative aspects of our result are somewhat stronger 
than previous methods. Similar results hold for other expansion measures like edge expansion, 
normalized cut, and conductance, with the r'th smallest eigenvalue of the normalized Laplacian 
playing the role of A r (G) in the latter two cases. 

Our proof is based on an l\ -embedding of vectors from a semi-definite program from the 
Lasserre hierarchy. The embedded vectors are then rounded to a cut using standard threshold 
rounding. We hope that the ideas connecting ^-embeddings to Lasserre SDPs will find other 
applications. Another aspect of the analysis is the adaptation of the column selection paradigm 
from our earlier work on rounding Lasserre SDPs [GS11] to pick a set of edges rather than 
vertices. This feature is important in order to extend the algorithms to non-uniform sparsest 
cut. 
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1 Introduction 

The problem of finding sparsest cut on graphs is a fundamental optimization problem that has 
been intensively studied. The problem is inherently interesting, and is important as a building 
block for divide-and-conquer algorithms on graphs as well as to many applications such as image 
segmentation [SMOO, SG07], VLSI layout [BL84], packet routing in distributed networks [AP90], 
etc. 

Let us define the prototypical sparsest cut problem more concretely. We are given a set of re- 
vertices, V, along with two functions C, D : (X) — > M + representing edge weights of some cost and 
demand graphs, respectively. Then given any subset T c V, we define its sparsity as the following 
ratio: 

* dcf Eu<v °u,v ■ IMm) ~ Mu)| m 
T Y,u<vDu,v\It(u)-1 t {v)Y U 

where It is the indicator function of T. Our goal in the SPARSEST CUT problem is to find a subset 

T C V with minimum sparsity, which we denote by <£* = f min^c V The special case of demand 
graph being a clique, where the denominator of eq. (1) becomes \T\ ■ \ V \ T\, is called the UNIFORM 
SPARSEST CUT problem. 

The value of the sparsest cut can be understood in terms of the spectral properties of cost and 
demand graphs. Let < Ai < A2 < • • • < A m be the generalized eigenvalues between the Laplacian 
matrices of cost and demand graphs (see Section 2 for formal definitions). In a way similar to 
the "easy" direction of Cheeger's inequality, we can use Courant-Fischer Theorem to show that 
Ai < $*. At some point, the eigenvalue A r will exceed <J>*. Our main result is an approximation 
algorithm for SPARSEST CUT which is efficient when this happens for small r. In particular: 

Corollary 1 (See Corollary 23). Given V and C, D : (^) — > R + ,for any positive integer r, one of the 
following holds. 

• Either one can find T C V with &t < 2<£* in time 2°^ poly(n) where n = |V|, 

• Or > 0.49A r . 

Our actual approximation guarantee is stronger and is stated in Corollary 23 (the above follows 
as a corollary with suitable choice of parameters). We can also get similar results for various 
expansion problems such as normalized cut, edge expansion and conductance using the same 
algorithm. 

1.1 Previous approximation algorithms for sparsest cut 

As the (Uniform) Sparsest Cut problem and closely related variants (such as edge expansion 
and conductance) are all NP-hard in general, theoretically much effort has gone into the design of 
good approximation algorithm for the problem. 

For Uniform Sparsest Cut problem, the hard direction of Cheeger's inequality shows one 
can "round" the eigenvector corresponding to Ai to a cut T satisfying &u < ^8d max Xi(G) where 
rfmax is the maximum degree. This gives 0(^/d max /&*(G)) < d max / Xi(G)) approximation 
which is good for moderate values of <&* for the case of UNIFORM Sparsest Cut. To the best of 
our knowledge, no analogue of this result is known for Sparsest Cut. 

For smaller values of <&*, the best approximation for SPARSEST CUT is based on solving a con- 
vex relaxation of the problem, and then rounding the solution to a cut. Using linear programming 
(LP), in a seminal work, Leighton and Rao [LR88] gave a factor 0(log n) approximation for SPARS- 
EST CUT (here n denotes the number of vertices). Beautiful connections of approximating sparsest 
cut to embeddings of metric spaces into the l\ -metric were later discovered in [LLR95, AR98]. 
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Using a semi-definite programming (SDP) relaxation, the approximation ratio was improved to 
Ofyiogra) for UNIFORM SPARSEST Cut in the breakthrough work [ARV09]. For Sparsest Cut, 
using £\ embeddings of negative type metrics, an approximation factor of 0(log 3//4 n) was ob- 
tained in [CGR08] and a factor 0(^/lognlog logn), nearly matching the UNIFORM SPARSEST CUT 
case, was obtained in [ALN08]. 

Recently, higher order eigenvalues were used to approximate many graph partitioning prob- 
lems. In [GS11], we gave an algorithm based on SDPs from the Lasserre hierarchy achieving an 
approximation factor of the form (1 + e)j min{l, A r } for problems such as minimum bisection, 
small set expansion, etc, where A r is the r'th smallest eigenvalue of the normalized Laplacian. On 
a similar front, for the UNIFORM SPARSEST CUT problem, if the r th eigenvalue is large relative to 
expansion, we show how to combine the eigenspace enumeration of [ABS10] with a cut improve- 
ment procedure from [AL08] 1 to obtain a constant factor approximation for UNIFORM SPARSEST 
CUT in time n°^2°^ r \ The details of this combination are briefly spelled out in Appendix B. We 
will revisit this approach in Section 1.2 to show why it does not work for SPARSEST CUT. 

A common theme in this line of work is that one can obtain a constant factor approximation 
with running time being a function of how fast the spectrum grows (both our algorithms in this 
paper and the ones in [GS11] in fact allow approximation schemes). Put differently one can iden- 
tify a generic condition which highlights what kind of graphs are easy. 

To the best of our knowledge, in the case of SPARSEST Cut with an arbitrary demand graph, 
no such results of the above vein are known. In fact, we are not aware of the analog of the harder 
direction of Cheeger 's inequality, let alone spectrum based approximation schemes. In this paper, 
we present such an approximation scheme based on the generalized eigenvalues. 

1.2 Overview of Our Contributions 

In this section, we briefly describe our main contributions in terms algorithmic tools and tech- 
niques over similar algorithms such as [GS11]. 

Main Contributions. Our algorithm is based on solving one of the strongest known SDP relax- 
ations, r-rounds of Lasserre Hierachy, similar to [GS11]. Any solution for this SDP yields a vector 
for each r-subset of vertices and each possible labeling of them. The rounding algorithm in [GS11] 
is based on choosing a set of r-nodes, "seeds", then labeling these using the SDP solution. Fi- 
nally these labels are "propagated" to other vertices independently at random. Such rounding is 
acceptable for constraint satisfaction type problems such as maximum cut. 

Unfortunately for problems such as Sparsest Cut, independent rounding is too "crude": 
It tends to break the graph into many disconnected components, which is rather disastrous for 
Sparsest Cut. 

In this paper, we consider a more "delicate" rounding based on thresholding. Our main con- 
tribution is to show how the performance of such rounding is related to some strong geometrical 
quantities of underlying SDP solution, and we show how to bound it using generalized spectra. 
Comparison with Subspace Enumeration. One successful technique for designing approximation 
algorithms based on higher order spectrum is subspace enumeration [KollO, ABS10]. Suppose we 
have a target set T corresponding to a UNIFORM SPARSEST CUT. These techniques rely on the 
fact that the indicator vector T should have a large component on the span of small eigenvectors. 
Thus by enumerating over the vectors on this subspace using some e-net, we can find a set whose 
symmetric difference with T is small. Combining this with a cut improvement algorithm due 
to [AL08], we can obtain an approximation algorithm for UNIFORM SPARSEST CUT problem with 
slightly worse approximation factors than ours (see Appendix B). 

1 We thank anonymous reviewers for bringing this paper to our attention. 
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Unfortunately the immediate extension of this approach to SPARSEST CUT by using the gen- 
eralized eigenvectors does not work as the generalized eigenvectors are not orthogonal in the Eu- 
clidean space. 

2 Preliminaries 

Sets. Let [m] = f {1,2,..., m}. Given set A and positive integer k, we use ( A ) (resp. (<f fc )) to denote 
the set of all possible size k (resp. size at most k) subsets of A. We use R+ to denote the set of 
non-negative reals. 

Euclidean Space. Given A C R and row set B, we use A B to denote the set of vectors where 
each row (axis) is associated with an element of B, and each coordinate is chosen from A. For 
any vector x G A B , its coordinate at axis b G B is denoted by x(b). Let ||x|| p be its p th norm with 

IMI = f \\ x \\2, and x T be its transpose. Finally for any x, y G R B , let (x, y) = x T y be their inner 
product Y.bfB x bVb- 

Matrices. Given A C R, row set B and column set C , we use ^^'^ to denote the set of matrices 
with rows and columns associated with elements of B and C, respectively. Given matrix X G 
A B,C , for any b G B, c G C, we will use Xf, jC G A to denote entry of X at row b and column c. For 
convenience, we use X c G A B to denote the vector corresponding to the column c of X. Likewise 
given subset of columns of X, S C C, we use X5 G A 8 ' 5 to denote the matrix corresponding to 
the columns S of X. Given matrix X, we use Tr(X) and X T to denote Frobenius norm of 

X, its trace and transpose. 

Finally we use X u and X 1 - to denote the projection matrices onto the span of X and its orthog- 
onal complement. 

Positive Semi-Definite (PSD) Ordering. Given a symmetric matrix X G R^'" 4 , we say X is a PSD 
matrix, denoted by X >z 0, iff y T Xy > for all y £R A . 

Eigenvalues. Given symmetric matrix X G R" 4 '" 4 , for any integer i < \A\, we define its i th smallest 
and largest eigenvalues as the following, respectively: 

. def . W T Xw def . W T Xw 

\i = max min — = — , a% = mm max — = — . 

rank(Z)<i-l w±Z,w^0 W 1 W rank(Z)<i-l wLZ,w^0 W 1 W 

Generalized Eigenvalues. Given two symmetric matrices X, Y G M. A,A with Y y 0, for any integer 
i < rank(y), we define their i th smallest generalized eigenvalue as the following: 

, def W T Xw 

\i = max mm — = . 

rank(Z)<i-l w±Z,Yw=£0 W 1 Yw 

Graphs. We assume all graphs are simple, undirected and edge-weighted with non-negative weights. 
We associate each graph with its edge weight function of the form W : (X) — > R+/ where we use 
W u>v to denote the weight of edge between u and v for convenience. 

Laplacian Matrices. Given a graph with weights W : (X) — > R + , the associated graph Laplacian 
matrix, L\y g MX' V , is defined as the following symmetric matrix: 

(^ W -,c if a = b, 

[Lw)a,b = \ c 

[-Wa, b if b. 
The following identity is trivial, from which we can easily see that Lw h 0: 

X G R Ty =^ Tr [X T XL W ] = ^ W U)V \\X U - X v \\ 2 . 
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2.1 Lasserre Hierarchy 

We present the formal definitions of the Lasserre Hierachy of SDP relaxations [Las02], tailored to 
the setting of the problems we are interested in, where the goal is to assign to each vertex / variable 
from V a label from {0, 1}. 

Definition 2 (Lasserre vector set). Given a set of variables V and a positive integer r, a collection of 
vectors x is said to satisfy r-rounds of Lasserre Hierachy, denoted by x G Lasserre r (V), if it satisfies the 
following conditions: 

1. For each set S G (<^li)/ there exists a function xs ■ {0, 1} S — >■ K T that associates a vector of some 
finite dimension T with each possible labeling of S. We use xs(f) to denote the vector associated with 
the labeling f G [k] s . For singletons u eV,we will use x u and x u (l) interchangeably. 

For f G {0, 1} S and v G S,we use f(v) as the label v receives from f. Also given sets S with labeling 
f G {0, 1} S and T with labeling g G {0, 1} T such that f and g agree on S(~)T, we use fog to denote 
the labeling of SL)T consistent with f and g:Ifu£ S, (/ o g)(u) = f(u) and vice versa. 

2. x ^ 0. 

3. (xs(f),XT(g)) = if there exists u G S n T such that f(u) ^ g(u). 

4. (x s (f),x T (g)) = (x A (f'),x B (g')} ifSUT = AuBandfog = f>og>. 

5. For any u G V, £je{o,i} IK0')l| 2 = \\x$ || 2 - 

6. (implied by above constraints) For any S G (< r l ^ 1 ), u G S and f G {0, 1} S \M, Ylge{o,i} u x s(f 
9) = x s \{u}(f)- 

3 Our Algorithm and Its Analysis 

The complete algorithm is presented in Algorithm 3. It is based on rounding a certain r'-rounds 
of Lasserre Hierarchy relaxation for the SPARSEST CUT problem given positive integer r'\ 

min ^" <1> st D u v \\x u — x v \\ 2 > 0, x G Lasserre r /(V), llxflll 2 = 1. (2) 

2-ju<v ^u,v || x u J'vW u<v 

It is easy to see that eq. (2) is indeed a relaxation of SPARSEST Cut problem. Even though it is not 
an SDP problem (it is quasi-convex), there is an equivalent SDP formulation. 

Lemma 3. The following SDP is equivalent to eq. (2): 

min C u>v \\w u — w v \\ 2 st Du,v\\w u — w v \\ 2 = 1, ||n>0|| 2 > 0, w G Lasserre r '(y). (3) 

Remark 4. The constraint \\w^\\ 2 > in eq. (3) is redundant, but we included it for the sake of clarity. 

Proof of Lemma 3. Given a feasible solution x of eq. (2), consider the following collection of vectors, 
w = \wT\rp^i v \. For each T G (¥,), we define wnr asw-r = f = 1 „ ^ x t- It is easy to see 

le \<r l ) - J VE U <„ D UtV \\x u -X v \\ 2 

that V „^„ D uv \\w u — w v \\ 2 = 1 and objective values are equal. Finally llu^H 2 = ^ n 1 N , — mr > 

SU<^U 3 J ± J f Z-^u<v u > v W Xu x v\\ 

since < J2 u <v D u,v\\ x u ~ x v\\ 2 < +°°- 

For the other direction of equivalence, suppose w is a feasible solution of eq. (3). For each 
T G (<^/), / G {0, 1} T , let xt{J) <— f^ w T(f)- It is easy to see that the objective values are equal. 
Rest of the proof for x being a feasible solution of eq. (2) follows in the same way with the previous 
direction. □ 
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Remark 5. Main components of our rounding, Algorithms 1 and 2, are scale invariant; thus the formula- 
tion given in eq. (3) is sufficient for rounding purposes. But we chose to first present eq. (2) as it is more 
intuitive. 



3.1 Intuition Behind Our Rounding 



For an intuition behind our rounding procedure, presented in Algorithm 1, we start with a simple 
rounding procedure, which is based on the seed based propagation framework from [GS11]. Later 
in this section, we will show how to fix it. 

First Attempt. Consider the following procedure. On input x G Lasserre2 r '+2(V): 

1. Choose a set of r'-edges from the demand graph, say S C ( 2 ) (seed edges). 

2. Let S be the set of their endpoints, S <s— {u G V | exists v such that {u, v} G S} C V. 

3. Observe that \S\ < 2r', hence the values ||£s(/)|| define a probability distribution over all 
labelings of S, f : S — > {0, 1} . So sample a labeling for S, f : S — > {0, 1}, with probability 

IM/)II 2 . 

4. Choose a threshold r G [0,1] uniformly at random and output the following set: 



T(f,r 



def 



U € V 



(Xo(f),X u ) 



(x -( f) x u ) 

In order for this procedure to make sense, the range of w^/Xp should be similar to t's range. In 



the following claim, we prove this. 



Claim 6. Provided that x § (f) £ 0, we have: (i) < < I for any u E V, (ii) \\*l'* f ]\\* < 



for any pair u, v G V, (Hi) -j^ 



(xM),x u ) 



■(/)! 



f(u)for any u G S. 



Proof of (i) and (ii). We will only prove (i), from which (ii) follows immediately. The lower bound 
follows from (xg(f),x u ) = \\x§ u i u y(f l)l| 2 — 0. For the upper bound, we have: 



(x~(f),X?;(f) ~ X u ) =||x?(/)|| 2 - (xM),X u ) = (X5(/),S ) - (x~(f),X u ) 



--(x § (f),x u (0)) = \\x §u{u} (fo0)\\' 



> 0. 



Proof of (iii). Follows from the fact that (xg(f),x u (f(u))) = \\xg(f)\\ 2 . 

Let's calculate the probability of separating two vertices by this procedure. 



□ 
□ 



Claim 7. E /)T 



Ylf | (Xgif)-, X u X v 



Proof. For fixed /, by Claim 6 the probability of separating u and v is equal to 
expectation over /: 



\{xg(f),x u -x v )\ 



. Taking 



\t T (f,r)(u) ~ 1t(/,t)(u) 



En^(/)ii 



| { x s(f)i x u x v) | 



^c(/)H 2 



/ 
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Second Attempt. For any fixed / : S — > {0, 1}, there are at most n different T(f, r)'s. Hence in- 
stead of choosing / : S — > {0, 1} and r G [0,1] randomly, we can perform an exhaustive search over 
all possible such sets and output the one with minimum sparsity. Since there are at most n2°( r ) 
many unique T(f, r)'s, the exhaustive search can easily be implemented in time poly(n)2°( r > . The 
rounding procedure along with this modification is presented in Algorithm 1. 

3.2 Seed Based ^-embedding 

We choose our embedding so as to reflect the rounding procedure outlined in the previous section. 

Definition 8 (Seed Based Embedding). Given x G Lasserre2 r '+2(V) and S C (^) with \S\ < r', let 
S be the end-points of edges in S so that S C L) . Then we define the seed based embedding of x as the 



following collection of vectors. For each u G V,y^ G R> u,1) is given by y^ 



IV s • • 7 s dcf 

J = 

/:5->{0,l} 



(Xc(f),X u ) 



Observe that — ||i is equal to the probability that u and v are separated as shown 
in Claim 7. 

It is well known that once we have an t\ -embedding, we can get a cut with similar sparsity 
by choosing the best threshold cut along each coordinate and this is exactly what we do in Algo- 
rithm 1. See Appendix A for a proof of the following well-known lemma. 

Lemma 9 (£\ Embeddings and Threshold Cuts [LLR95]). Given a set of vertices V, a collection of 



Vu G K T 



representing an embedding ofV, the following holds. For any C,D : („) 



vectors 

being the edge weights of graphs G and H, respectively: 



■ ^ - £k<d C uv \\y u Uv\\i 
min$ T (j T ) < — — — . (4) 

Here T(f, r) *^ {u G V\ y u (i) > t] represents the threshold cut along coordinate / G T. 

In the rest of this section, we will upper bound eq. (4) for our embedding from Definition 8. 
Claim 10. ||yf - ||i < \\x u - x v \\ 2 . 

Proof. Since x G Lasserre2 r '+2(^), we can express x u and x as . 

— X uv (10) ~t~ %v — ^u,u(01) ~\~ ^ti,f(ll) %u %v — %u,v (10) *^ti,i;(01)' 

The following identity follows easily 2 : 

\\x u - x v \\ 2 = ||x^(10)|| 2 + ||x«,„(01)|| 2 - 2 (x u>v (10),x u , v (01)) = ||x^(10)|| 2 + ||x n ^(01)|| 2 . (5) 

* „ ' 

=0 

Therefore: 

\yi(f)-Vv(f)\ = \(x § (f),x u -x v )\ = \{x § (f),x UtV (10)-x UiV (01)}\ 
< \(xg(f),x u , v (10))\ + \(x~(f),x u , v (01))\ 



2 Intuitively, it corresponds to the following. The "probability" of u and v are separated is equal to the probability 
of u and v being labeled with 1 and or and 1. 
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For any g : {u,v} -»• {0,1}, (x§(f),x u>v (g)) = \\x§ u{u v} (f o g)\\ 2 > 0. Thus: 

=(x § (f),x u , v (10)) + (x § (f),x u , v (01)). 
Summing over / and using the fact that x$ = J2f x §(f)' 

hi-Vvh <lj2 x s(f)i x uA 10 )+ x u,v( 01 )) = (s0,a:«,i>(lO)+a;«,«(Ol)) 

= ||x ni ,,(10)|| 2 + ||x nil ,(01)|| 2 = ||s u -s„|| 2 byeq.(5). □ 
11. ||yf -yfHi > £ 



Claim 11. ||yf - yf ||i > E/^/^o^sC/")' «u - ^> 2 wtene x^(/) d = is tfie unit vector for 



Proof. For any / : x^/) / 0, by Claim 6, < < 1 thus ^gg^ > ( ^g^jp> ) 2 . 

Multiplying both sides with H^/)!! 2 > 0, we obtain |(sj(/),s tt -x„)| > ^PgS^" ' 

Summing over all / : / 0, we obtain the desired lower bound, ||yf — y^||i = E/ \( x §(f)i x u — x v )\ > 

In its current form, our lower bound is not very useful as it involves the higher order vectors 
(xg(f)'s) from our relaxation. Unfortunately these vectors are very hard to reason about: We do 
not have any direct handle on them. Therefore our goal is to relate this expression to some other 
expression that only involves the vectors for edges (x u — x v 's). We first introduce some notation. 

dcf 



Notation 12. LetU § = £ /:a ._ (/) ^ x § {f) ■ x § (f) 

We can rewrite the lower bound from Claim 11 in terms of as follows: 

^ x u x v) — ( x u x v) [ ^ ^ x $(.f) ' x §(f) I ( x u x v) — {. x u x v) ITj^u x v)- (6) 

/ V / / 

It was first observed in [GS11] that IL^ has a special structure - it is a projection matrix onto the 
span of vectors {xg(f)}f. 

Proposition 13. IT- = n^, i.e. is a projection matrix onto the span of vectors in {xg(f)}. 

fl if f = g, 

Proof. Observe that (x~(f), xx(g)) = < .Then we have: 

° D 10 else 

f,9 f 

For each seed edge {u, v} G S, x u — x v G span {x§(f)}. This means we can lower bound the 
matrix II^ in terms of the projection matrix onto the span of vectors corresponding to seed edges! 

Notation 14. Let P$ be the projection matrix onto the span of {x u — x v }f u>v \ e s. Similarly let P$ be 
projection matrix onto the orthogonal complement of {x u — x v }j U)V \ e g, P$ = ^ ~ ^ ere ^ zs ^ e 
identity matrix. 
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Lemma 15. - y^\\ t > \\P s (x u - x v )\\ 2 = \\x u - x v \\ 2 - \\Ps(x u - x v )\\ 2 . 

Proof. From Claim 11 and eq. (6) we see that ||yf — Ik > (x u — x v ) T Hg(x u — x v ). For any «£5, 
x u = J2f-f( u )=i x 5(/) hence x u e span{x^(/)}. In particular, for any pair u, v G S: x u — x v G 
span{x i j(/)} / which means: 

span{x u - x v } {uv}eS C span{x u - z„} u „ 6( 5 C span{x^(/)} =>- ILj ^ P 5 = Pj. 

Consequently, (x u - x v ) T ILg(x u - x v ) > (x u - x v ) T P%(x u - x v ) = \\P s (x u - x v )\\ 2 . □ 

We wrap up this section with the following theorem. 

Theorem 16. Given x e Lasserre r ' (V) and a set of seed edges S C (Y) with projection matrices Ps,Ps 
as in Notation 14; let T c V be the set returned by Algorithm 1 and y s be the embedding as described 
in Definition 8. Then the following bounds hold: 



$T < 1 Hu< V CuA\yi -VvWl < ( 1 _ Eu<v D u,v\\Ps( X u - Xv)\\ 2 \ 



c^SDP - ^BP^ u<vDu JyS_ y S h ~ I Zu< V D U , V \\X U - X, 



12 



(7) 



where 



SDP 45. f E„<„ c^^ii^-x^ii 2 



1^- 



Proof. $ T < S^^ii; ^f! 1 follows from Lemma 9. Claim 10 and Lemma 15 together imply 



Xm<i> \\Vu Vv\\l ^2u<v Du,v\\x u X v \\ 2 ^, u<v Du.v\\P$(x u x v )\\ 2 

_^,SDP I i _ ^u<v Du,v\\Ps ( x u - X v )\\ 2 \ 
\ J2u<v Du,v \\x u — x v \\ 2 J 

3.3 Choosing Seed Edges 

Notation 17. Given x = [xt 6 K T ] and D : (^) — > R + , let X e if'^) be the following matrix whose 
columns are associated with vertex pairs: X = f [y/D u , v (x u — x v )] ^ u v y e (vy 



Observe that X = J2 u <v Du,v \\x u — x v \\ 2 . Since S C (^), the matrix X$ is well defined. 

F 

Moreover there is a strong connection between Xj? and P$, which we formalize next: 
Claim 18. P s h {X S ) U . Furthermore ifS C support(D) then P s = {X S ) U . 

Proof. Recall that S C ( 2 ) and Ps represents spanja^ — x v }{ u jj; } g< s, which contains every column 

of X s = [y/D UiV (x u - x v )] MeS - □ 

(II ("J? s )- 1 - JO 2 ^ 
1 — F 

One way to think about ||(J?'5)-'-X |||. is in terms of column based matrix reconstruction. If we were 
to express each column of X as a linear combination of only r-columns of X, what is the mini- 
mum reconstruction error (in terms of Frobenius norm) we can achieve? Without the restriction of 
choosing only columns, this question becomes easy to answer: Sum of all but largest r eigenvalues 
of Gram matrix, X X. We formalize this in Claim 20. 
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Notation 19. Let a\ > ai > . . . > a m > be the eigenvalues of X T X in descending order. 

Claim 20. For any seed set S C (£) with \S\ = r - 1, WiXs^Xfp > Y,j> r +i a r 

Proof. Follows from rank(X<?) < |5| = r and Courant-Fischer Theorem. □ 

In [GS12b], it was shown that choosing ~ | many columns suffice to decrease the error within 
a (1 + e)-factor of this lower bound and this is essentially the best possible up to low order terms. 

Theorem 21 ([GS12b]). For any positive integer r and positive real e, there exists + r — l) columns of 

^ ^ _l_ ^ 2 

X, S, such that \Xs) X < (1 + e) Y^j> r +i a i- furthermore there exists an algorithm to find such S 
in time poly(n) (recall X has = 0(n 2 ) columns). 

Our seed selection procedure is presented in Algorithm 2. We bound Ylj> r +i a j m Theorem 22, 
whose proof is given in Appendix A. Main approximation algorithm combining Algorithms 1 
and 2 is presented in Algorithm 3 with its analysis in Corollary 23. 

Theorem 22. Let < Ai < A2 < . . . < X m be the generalized eigenvalues ofLaplacian matrices for the 
cost and demand graphs. Then for X being the matrix defined in Notation 17, the following bound holds: 

Ej>r+1 gj < ^ SDP 

\\X\\ 2 F ~ A,+l ' 
We put everything together in the following corollary. 

Corollary 23. Given C, D : (X) — > R + representing cost and demand graphs, let < Ai < A2 < . . . < 
A m be their generalized eigenvalues in ascending order. For any positive integer r and real e > 0, on input 

C, D and r' = f | + r — I, Algorithm 3 outputs a subset T C V whose sparsity, <3?t, is bounded by: 

$ T < $* (\ - (1 + e)-^—\ provided that (1 + e)-^— < 1. 

Furthermore using the SDP solver from [GS12a], the running time can be decreased to 2°( r ') poly(n). 
Proof. Follows from Theorems 16, 21 and 22. □ 
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Algorithm IT = Round(C, D, x, <S): Seed based rounding in time 2°( r ) poly(n). Sparsity of its 
output is bounded in Theorem 16. 

Input: • C,D : ( 2 ) — > M + ; x G Lasserre2 r '+2(^) an d seed set S C ( 2 ) with |<S| < r'. 
Output: • A set T C V representing an approximation for SPARSEST CUT problem. 

Procedure: 1. S <— {u G V | exists v such that {u, v} G S} C V. 

2. For each / : 5 -> {0, 1}, 

(a) Let^ : [rt] ->■ V be an ordering of V so that (x§(f),x p f^) < ... < {x§(f),x pf ^}. 

(b) For each ie [n], let T(/, i) <- {p / (l),p / (2), ... 

3. T <- axgmin /: ^ {0)1} ieW $ T(/ji) . 



Algorithm 2 5 =SELECT-SEEDS(D, x): Seed selection in time poly(n). 
Input: • x G Lasserre2 r '+2(^0 and L> : (^) — >■ M + as the demand graph. 

Output: • S C ( 2 ) with |<S| < r' as a set of seed edges. 

Procedure: 1. LetX^ [^/A^x,, - i t )] {Ui ,} e m. 

2. Use the algorithm from [GS11] to choose r'-columns, S C (^), of matrix X and return 5. 



Algorithm 3 T =APPROXIMATE-SC(C, D, r'): Main algorithm for approximating SPARSEST CUT. 
Sparsity of the output is bounded in Corollary 23. A naive implementation will run in time ra°( r ) . 
However this algorithm exactly fits into the local rounding framework introduced in [GS12a], 
therefore we can use the faster solver from [GS12a] to decrease the running time to 2°v> poly(n). 

Input: • C, D : Q -»• M + as the cost and demand graphs, respectively. 

Output: • A set T C V representing an approximation for SPARSEST CUT problem. 

Procedure: 1. Compute a (near-)optimal solution, x, to the following SDP: 

min ^2u<v ^u.v\\xu %v\\ /o\ 
st Y,u<v D u,v\\ x u - x v\\ 2 = 1, Ikoll 2 > 0, x G Lasserre 2r '+2(y). 

2. LetS <- SELECT-SEEDS (-D, x) (Algorithm 2). 

3. Let T <- R0UND(C, D, x, S) (Algorithm 1). Return T. 
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A Omitted Proofs from Section 3 



Lemma 24 (Restatement of Lemma 9). Given a set of vertices V, a collection of vectors 



Vu G 



i(zV 



being the edge weights of 



representing an embedding ofV, the following holds. For any C,D : ( 2 ) —?■ 
graphs G and H, respectively: 

. _ , Y,U<V C U ,V \\Uu — Uv\\l 

mm ™t(/,t) — p; n rr~ • 

Here T(f, r) ^= f {u G V\ y u (i) > r} represents the threshold cut along coordinate f G T. 

Proof of Lemma 9. Let</> = f min$ r(/T ). For any/ G T,let<5/ d = max ajf) \y a (f)-yb(f)\ = max b y b (f)- 

min a y a (f) and A d = Ylf^f- 

Consider the following randomized process. Choose / G T with probability proportional to 
5f and then sample a threshold r G u [min a y a (f), max;, Then: 



|1t(/,t)H - 1t(/,t)(^)| 



£ 

/eT 



<*/|l/«(/)-Jfo(/)| _ 1 



A 



^WUu DvWi 



Moreover, for any / and r, by definition of cf>: 

^C U)V |l T(/jT) (n) - l r(/jT) (u)| > <f>J2 D u,v |1 T (/ )T )(«) - 1t(/, t )(«) 



Putting it all together: 



^2u<v Cu,v WVu Vv\\l 

^2,u<v d u ,v \\yu — Vv\\i 



C u ,v |l T (/ |T )(«) - 1t(/,t)( u )| 



E 



4- > 



|1t(/,t)(«) - 1t(/,t)W| 



□ 



Theorem 25 (Restatement of Theorem 22). Let < Ai < A2 < . . . < A m be the generalized eigenvalues 
ofLaplacian matrices for the cost and demand graphs. Then the following bound holds: 



E 



j>r+ l U J 
\X\ 



SDP 



< 



+1 



where X is the matrix given in Notation 1 7. 

11 — 1 1 2 

Proof of Theorem 22. Since the claimed bound is scale independent, we may assume H^H^ 
without loss of generality. 

Throughout the proof, we will use the following matrices: 



def 



• X = [x u ] u£V G 



Be G M.^)' v is the following edge-node incidence matrix of the cost graph whose columns 
and rows are associated with vertices and edges, respectively. Its entry at column c G V and 
row {a, b} G ( 2 ) with a < b (assuming some consistent ordering of V) is given by: 

1 if c = a, 

( B c){ a ,b},c = I - 1 if c = 6, 

else. 
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• Bq G RvaJ'^ is defined similarly for the demand graph. 

• Lc, Ld are the Laplacian matrices for cost and demand graphs, respectively. 

• (Ld ) t is the pseudo-inverse oi Ld- 
The following identities are trivial: 

X = X(Bd) t ; Lq = BqBc; Ld = B]jBd- 

Moreover 

£ CuAxu - *,|| 2 = \\XB T c f F = Tr (XL C X T ) = d? SDP Tr (XL D X T ) = d> SDP 



by our assumption that ||X||f, = Tr (XLdX t ) = 1. 

Since (Ld) u is a projection matrix and Lc >r 0, we have Lc >r (LD) n Lc(LD)) n . Substituting 
the identity (Ld) 11 = Ld(Ld>)t = (Ld)^Ld into this lower bound, we have: 

L c hL D L D L c L D L D = {B d ) t \b d L d L c L d {B d ) t ^B d , 

dcf z 

$ SDP = Tr (XL C X T ) > Tr {x(L>d) T [5dZ(F? d ) T ] 5d^ T } 
= Tr{x [B D Z{B D ) T ] X T } 

The null space of X = X(Bd) t contains the null space of (Bd) t ■ In particular, non-zero eigen- 
vectors of X T X are contained in the span of Bd(Bd) 7 \ Using von Neumann-Birkhoff Theorem, 
we obtain: 

j j>r+l 

B Using Subspace Enumeration for Uniform Sparsest Cut 

Throughout this section, we will assume that the cost graph with weights C : ( 2 ) — > R + is 1- 
regular. Since G is regular, definitions of uniform sparsest cut / normalized cut and edge ex- 
pansion / conductance coincide. Thus we will focus only on UNIFORM Sparsest CUTwhich we 
denote by 0* . 

The following theorem is adapted from [AL08] for our setting: 

Theorem 26 (Cut Improvement, see [AL08]). For any x* e {0, 1} V , given x e {0, 1} V satisfying 

n , (x, x*) \\x\\i 
< Hh < - and ,, ' „ 1 > — — 
2 H^nli n 

in polynomial time one can find y e {0, 1} V whose edge expansion is within a factor 

1 - |jgj|i/n 
~~ {x, x*)/\\x*\\i — ||x||i/n 

ofx*'s edge expansion. 
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The following lemma is adapted from [ABS10]: 

Theorem 27 (Eigenspace Enumeration, see [ABS10]). In time 2°^n° ( - 1 \ there exists an algorithm 
which outputs a set X C {0, 1}^ that contains some x e X with following property: There exists x* e 
{0, 1} V with: 

\\x - a:* ||i < _8_ 
ii * 1 1 — \ 

\\X ||l A T 

Combining these two, we obtain the following: 

Corollary 28. For any positive integer r, if r th smallest eigenvalue of Laplacian matrix for cost graph 
satisfies A r > 84>* where (j)* is the Uniform Sparsest Cut value, then in time n°^>2°^ one can find 
y £ {0, 1} V whose uniform sparsity is bounded by: 

2(f)* 
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